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7. Cause and Effect 

BY E. CUYLER HAMMOND 


Causation 

THE EVENTUAL AIM OF EPIDEMIOLOGIC RESEARCH IS TO 
discover means by which conditions may be altered in such a 
way as to lower disease incidence and mortality rates or at least 
to limit their rise. Thus it becomes a search for causative factors. 
This chapter is concerned with methods of approach in attempt¬ 
ing to establish cause and effect relationships in this field of 
science. 

The cause of an effect has sometimes been defined as a single 
factor which antecedes, which is necessary, and which is suffi¬ 
cient to produce the effect. Clearly this definition is inadequate 
for the study of biologic phenomena, which are produced by a 
complex pattern of environmental conditions interacting with 
the highly complex and variable make-up of living organisms. 
For the production of any effect there must be at least two 
factors which come together or one of which acts upon the other. 
In biologic phenomena the factors are usually manifold. 

Wolf” has defined the cause of a certain effect as “that totality 
of conditions that is sufficient to produce it.” He further states 
that “as a rule a cause is complex — it consists of a number of 
conditions each of which is only a part of the cause.” 

It is frequently assumed that the theory of cause and effect 
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THE BIOLOGIC EFFECTS OF TOBACCO 

must lie based upon the acceptance of the concept of physical 
determinism. That is, it is assumed that at any time precisely tile 
same combination of conditions occurs, then precisely the same 
ellect will follow, Fisher 1- ' maintains that an indeterministic view 
of causation has many advantages, particularly in the biologic 
sciences. He says: 

Biologists, perhaps, as this view becomes more familiar, will 
be more inclined to elaborate independently concepts appropri¬ 
ate to their own field of study, and be less inclined to assume 
that all improvements in method must be those introduced from 
the “exact” sciences of physics and chemistry. 

It is a statistical principle that averages arc highly stable when 
based upon observations of enormous numbers of independent 
units. It is for this reason that physical and chemical experiments 
involving billions of atoms may be reproduced with apparent 
exactitude. When the numbers are small, the average is no longer 
so stable and varies somewhat from one experiment to the next. 
The reactions of a single particle may be unpredictable. There¬ 
fore, in modem physics, predictions are often made not in exact 
terms but in terms of probability. Biologists were faced with this 
problem at a much earlier date, for two reasons: (1) they are 
usually forced to work with relatively small numbers of indi¬ 
vidual units (i.e., living organisms); and (2) living organisms 
are variable. Because of individual variability in reactions to a 
given set of environmental conditions (and because it is not 
always possible to control all environmental conditions), it is 
frequently necessary to express findings in statistical terms (e.g., 
averages, percentages, and measures of variability). The search 
for cause and effect relationships then usually takes the form of 
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CAUSE AND EFFECT 

comparing groups of similar individuals that have been exposed 
to different environmental conditions, or groups of genetically 
different individuals that have been exposed to the same environ¬ 
mental conditions. Such differences ns appear between groups 
must then be tested to determine whether they are statistically 
significant before any attempt is made to draw conclusions from 
them. 

For the purposes of this discussion, let us define a cause of 
a disease as a totality of conditions that is sufficient to produce it. 
Then let us define a causative factor as any one of the conditions 
which taken together make up a cause. Most diseases have been 
defined in such a way that a specific causative factor (e.g., a 
specific infectious agent or a specific dietary deficiency) is neces¬ 
sary for the production of a specific disease. Cancer and a few 
other diseases are not so defined at the present time. For ex¬ 
ample, carcinoma can be produced experimentally by a number 
of different chemical substances, so no one of them can be called 
a necessary factor. Whether or not a specific factor is necessary 
by definition, no single condition is sufficient in itself for the 
production of a disease since other conditions must also be satis¬ 
fied. For example, an infectious disease does not occur unless the 
infectious agent is transmitted to a susceptible host. Any one of 
several different modes of transmission may act as a causative 
factor, and it is essential that at least one of them be operative 
to produce a certain disease; but no single one of them alone is 
necessary. Likewise, a number of different independent factors 
may influence susceptibility. Any one of these may act as a 
causative factor in a particular case. Since a cause of a disease 
has been defined as a totality of conditions sufficient to produce 
it, and since most diseases may be produced by a number of 
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THE BIOLOGIC EFFECTS OF TOBACCO 

different specific sets nf conditions, it follows that most diseases 
have a multiplicity of causes. 

In some instances a causative factor may have an all or none 
eflect. In other words, all other conditions being satisfied, the 
presence or absence of one particular factor determines whether 
01 not the effect will be produced. More usually, a causative 
factor acts quantitatively. For example, the probability that 
cancer will develop when a carcinogenic agent is painted on the 
skin of a mouse depends, among other things, on the dosage 
and how long the skin painting is continued. The probability 
that an individual will develop tuberculosis depends in part 
upon the degree to which he is exposed to patients with this 
disease. 

A tremendous number and variety of conditions occur in 
nature, some conditions being independent of each other and 
others being associated, The simultaneous or sequential occur¬ 
rence of a set of conditions such as to cause a particular disease 
in a particular individual is largely a matter of chance. This 
being the case, the presence of any condition which may enter 
into a causative combination increases the probability that the 
disease will occur. For example, under certain conditions the 
presence of a swamp may increase the probability of the occur¬ 
rence of malaria (i.e., increase malaria incidence rates), although 
a swamp is not a necessary factor for the causation of this 
disease. Therefore, a causative factor may be thought of as any 
condition which, in a given situation, increases the probability 
that a specified event will occur. Though present, a potential 
causative factor may or may not be operative in any particular 
instance. The relative importance of any one factor depends 
upon the total situation. 
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What is suggested here is a statistical or probability approach 
to the problem of cause and effect relationships. As Fisher has 
pointed out, this is particularly suitable in the hiologic sciences. 
Those situations in which a single factor appears to have an all 
or none effect are special cases where the probability of the 
occurrence of the event is either 0 or 1. 

Artificial vs. Natural Situations 

The concept of causality probably originated from a common 
human experience. Man, to a limited degree, possesses the 
ability to create sets of conditions. He has learned by trial and 
error that certain sets of conditions are frequently if not always 
followed by certain events. To the extent that this is so, man, by 
deliberate, willful acts can produce results which he desires. The 
man-created conditions, which arc in a sense artificial, are said 
to be the cause of the subsequent event which is said to be the 
effect. If a set of conditions is artificially created many times and 
some subsequent event occurs invariably or nearly so, then this 
by definition is taken as proof of a cause and effect relationship. 
Both invention and experimental science are outgrowths of this 
experience. 

Events are continually occurring in nature which are quite 
independent of any willful human act. From time immemorial 
man has sought for an explanation. The only rational explana¬ 
tions he has ever devised have been in terms of analogies with 
the cause and effect relationships exhibited in man-created 
situations. In primitive societies, naturally occurring events are 
usually attributed to willful acts on the part of manlike gods. In 
modern times, naturally occurring events are ordinarily at- 
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THE 151 O LOGIC EFFECTS OP TOBACCO 

tributed to antecedent sets of naturally occurring conditions. If 
Fisher’s views are accepted, this does not preclude the possibility 
of a creative element in nature nor the possibility of acts of free 
will on the part of man. 

Epidem iologic R esc arch 

In epidemiologic research, the problem is to discover causa¬ 
tive factors which increase the probability of human disease or 
death and are operative and important under various conditions 
of human life which occur with appreciable frequency. For want 
of a hotter word, we may call these natural conditions; that is, 
conditions which occur independently of any act on the part of 
the investigator. Generally speaking, there are two methods of 
approach. One is to observe the natural conditions under which 
disease occurs in man. The other is for the investigator to create 
a set of conditions and then observe subsequent events. These 
two approaches are supplementary to each other and usually 
both are employed. Whichever approach is used, the investigator 
makes a study of associations and/or sequences of events. An 
attempt is then made to interpret these in terms of cause and 
effect relationships. 

For a causative factor to be operative in the production of a 
disease it must necessarily precede the occurrence of the disease 
and be associated with the disease. Such a relationship is of no 
practical importance in epidemiology unless it prevails under 
natural conditions. Whether or not this is so in the case of a 
particular factor can only be determined by a study of tbe 
natural conditions under which the disease occurs in man. A 
similar line of reasoning applies in the case of a causative factor 
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for the fatal outcome of a pre-existing disease. This is probably 
the most important principle in the science of epidemiology. 

Many different methods have been used to ascertain these 
critical facts in relation to various diseases. Two methods de¬ 
serve special consideration in this book, these being (1) the 
retrospective or historic method, and (2) the prospective or 
follow-up method. 

Retrospective Studies 

The retrospective method consists of questioning patients 
with a particular disease about their past habits, occupations, 
exposures, family histories, etc, One or more other groups of 
individuals (usually called control groups) are similarly 
questioned. A comparison is then made of the answers given by 
individuals in each of these groups. Many studies of this type 
have now been made in which patients with lung cancer have 
been questioned as to their past habits in relation to smoking. In 
all of these, a very much higher percentage of heavy smokers, 
particularly cigarette smokers, were found among the lung 
cancer patients than among the controls. Certain of these studies 
have ruled out a number of other factors as being important in 
lung cancer or as contributing to the observed association with 
cigarette smoking. Similar studies have shown a relationship be¬ 
tween the use of tobacco and cancer of the buccal cavity. 
These findings are described in detail in another chapter of 
this book. Using the same method, Gertlcr and White 7 found 
an association between cigarette smoking and coronary artery 
disease. 

This method has a number of important advantages. In the 
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THE BIOLOGIC EFFECTS OF TOBACCO 

case of a rare disease it is the only feasible way of studying such 
factors in a large number of patients. It can be confined to 
patients with a well-confirmed diagnosis, and the same doctor 
or group of doctors can review all of the diagnoses. It is suitable 
for the investigation of a great many dillercnt factors at the 
same time. It is relatively inexpensive and can be carried out in 
a relatively short length of time. 

On the other band, the retrospective method is subject to 
several difficulties which may or may not be important in any 
particular study, The first of these is bias, either on the part of 
the patient or the interviewer. The patient, because of his disease, 
is apt to be emotionally upset and perhaps in pain or under the 
influence of drugs at the time lie is questioned. Furthermore, it 
is likely that he has been questioned previously by doctors, with 
the result that certain ideas concerning his disease which he 
would not otherwise have thought of were impressed upon his 
mind. Under these conditions, his answers are apt to be differ¬ 
ent from what they would have been had he been questioned 
prior to the development of the disease. Obviously, this is im¬ 
portant if responses arc incorrect, exaggerated, or minimized. 
It may be equally important if the answers are strictly correct. 
For example, if patients with lung cancer, because of previous 
questioning, are more apt than well persons to recall exposure 
to an insecticide spray, this factor would show up as a difference 
between the control group and the patient group. The inter¬ 
viewer may have preconceived notions about the etiology of 
the disease, in which case he is apt to inadvertently suggest 
answers to the respondent. He may also get misleading informa¬ 
tion if he has a compulsion to identify the cause of each 
individual case of the disease. For example, suppose the inter- 
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viewer believes that there are three causes of epidermoid car¬ 
cinoma of the lung: smoking, certain industrial dusts, and certain 
vapors. Then if the patient says that he is a heavy smoker the 
interviewer may ask only routine questions about the other 
factors; but if he is a nonsmoker the interviewer may question 
him extensively to recall to his mind a forgotten contact with 
one of these other factors. 

These possible biases can be minimized by employing well- 
trained, unprejudiced interviewers and by questioning all patients 
on admission to a hospital or clinic before the final diagnosis has 
been established. The latter procedure, which has been used by 
Levin et al.'" and other investigators, is time-consuming but 
generally increases the reliability of the findings. 

The second major difficulty in retrospective studies lies in 
the selection of the control groups. Insofar as is possible, the 
individuals in the control groups should be matched with patients 
having the disease in question for such factors as age, sex, and 
socioeconomic status. Failure to do so can be a serious error. 
For example, a study might be made of the smoking habits of 
female cancer patients, all of whom were the private patients of 
a famous surgeon; merely because of availability, the control 
group might consist of visitors to the wards of a hospital located 
in a slum area. Differences in smoking habits between the two 
groups might be due to socioeconomic differences between the 
two groups rather than to a true association between cancer and 
smoking. Such obvious errors are avoided in all well-planned 
studies. However, even without this particular type of bias, com¬ 
parative studies of hospital patients with various diseases are 
subject to a selective factor which has been described by Berk- 
sort.- The difficulty can be minimized, and sometimes eliminated, 
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by drawing a matched control sample from the population group 
from which the patients came. This was done in the classic study 
of Lombard and Doering 11 on the relationship between smoking 
habits and cancer of various sites. 

A third possible difficulty is that in some instances a habit 
or condition may influence the probability that a patient with a 
particular disease will come to a hospital (or come to it in a 
physical condition in which he can be questioned). For example, 
smoking might cause a severe cough in a patient with early lung 
cancer who would otherwise have been symptom-free up to the 
terminal stages of the disease. This symptom might bring him 
to the hospital. On the other hand, a heavy smoker with lung 
cancer might attribute his symptoms to smoking and therefore 
fail to seek medical aid up to the terminal stages of the disease. 
A general tendency in either of these two directions would bias 
the results of a retrospective study. 

Prospective Studies 

The prospective or follow-up method consists of questioning 
an extremely large number of apparently well people and then 
dividing them into various groups according to their answers. 
They are then traced for a number of years and the disease in¬ 
cidence and/or mortality rates in various groups are compared. 
The method has the following disadvantages: It is very costly 
and necessarily takes a long time. Because of the number of 
subjects, the questionnaire or interview must be kept short and 
simple. Because of cost and other factors the investigator is 
usually forced to study mortality rather than incidence rates. In 
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some cases the diagnosis of cause of death is uncertain or in 
error. 

On the other hand, the prospective method has a number of 
very important advantages. Since the subjects arc well, or ap¬ 
parently so, at the time of questioning, there is little danger of 
cither respondent or interviewer bias. Human memory is not 
infallible, so a certain number of errors occur in answers to the 
questions, but these errors arc generally random in respect to 
later occurrence of death from a particular disease. Random 
errors reduce associations but do not produce the false appear¬ 
ance of associations. While the questionnaire must be kept short, 
various factors (such as urban and rural differences) can be 
investigated by a proper selection of subjects. The problem of 
selecting a control group does not arise. Information is obtained 
on total mortality by cause of death (which cannot easily be 
done in a retrospective study). This is a tremendous advantage 
if the condition studied is a causative factor in mortality rates 
from two or more diseases. 

Now suppose that a certain factor such as smoking increases 
the probability of early diagnosis and consequent hospitalization 
for a particular disease. As has been noted, in a retrospective 
study of hospital patients this would produce a false impression 
of a positive association between the factor and the disease. The 
same selective factor would have the opposite effect on the early 
findings in a prospective study. Therefore, if the same association 
is found in both retrospective and prospective studies it indi¬ 
cates that the association was not the product of such a relation¬ 
ship. Even if a bias of this type does exist, its influence diminishes 
with time in a prospective study. 
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THE BIOLOGIC EFFECTS OF TOBACCO 

Because of the cost and other difficulties, it is seldom possible 
to embark on a prospective study of a particular habit or condi¬ 
tion unless there is very good reason to suspect that it is im¬ 
portant. For this reason, it is usually best to first investigate 
the matter by the retrospective method and then check the find¬ 
ings by the prospective method if warranted. If both methods 
show the same associations to a statistically significant degree, 
then it is virtually certain that the findings are correct. 

As described in other chapters, the results of two prospective 
studies 3 ' s on smoking in relation to mortality rates have now 
been published. Both of them fully confirm the findings of early 
retrospective studies in respect to an extremely high association 
between lung cancer and cigarette smoking. They have not yet 
been in progress long enough for valid conclusions concerning 
a possible association between lung cancer and pipe and cigar 
smoking. Both studies showed an association between smoking 
habits and death rales from coronary artery disease, the asso¬ 
ciation increasing with tire amount of smoking. In the larger of 
these studies 3 this association applied only to cigarette smoking 
(not to pipe or cigar smoking). Also in the larger of these studies 
a remarkably high degree of association was found between ciga¬ 
rette smoking and total mortality rates. 

Pearl, 1 - in a study of quite different design, found a high de¬ 
gree of association between total mortality rates and smoking 
habits in men. 


Studies of Restricted Population Groups 

One other method of approach should be mentioned briefly 
at this point, ft consists of studying incidence or death rates 
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from a particular disease in very restricted population groups 
such as persons employed in a certain type of factory or mine. 
By a careful analysis of working conditions (including such pro¬ 
cedures as the collection of air samples at nose level of workers 
at certain jobs) and relating these to disease rates, it is often 
possible to identify specific causative factors. This method is 
particularly suitable for studying diseases which are rare in the 
general population. 

Association vs. Causation 

As was said earlier, if one condition is a causative factor in 
the production of another, then the two must be associated. The 
converse of this statement is not necessarily true. Two condi¬ 
tions may be associated with each other only because some third 
condition (or set of conditions) is a causative factor in the pro¬ 
duction of both. For example, the fact that Negroes have dark 
skins and high tuberculosis death rates does not prove that skin 
pigmentation is a factor in the causation of tuberculosis. The 
most difficult problem in epidemiology is to distinguish between 
associations which result from direct causal relationships and 
those which have a different explanation, 

The degree of association between a certain factor and a par¬ 
ticular disease may provide a valuable clue in attempting to 
make the type of distinction just described. Suppose that an 
extremely high association exists only because some third condi¬ 
tion is a causative factor for both the disease and the factor in 
question. Then there must exist an equally high association be¬ 
tween the third condition and the disease as well as between the 
third condition and the factor under consideration. In such cases, 
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THE BIOLOGIC EFFECTS OF TOBACCO 

it is usually not difficult to identify the third condition. An ex¬ 
ample follows. 

Suppose that in a certain city in Utah the age-specific lung 
cancer death rate was found to be ten times as high among 
workers employed in the gas plant as among men in the rest of 
the community. Some occupational exposure (presumably ex¬ 
posure to gas) is the suspected cause of this difference. However, 
it is conceivable that the dilTcrcnce was the result of some factor 
not directly connected with conditions in the plant. For example, 
almost all of the gas company employees might be extremely 
heavy cigarette smokers (being skilled workers of various re¬ 
ligions brought in by the gas company), whereas the other men 
in the city were nonsmokers because they were Mormons. If 
this were the case, such an obvious fact could hardly escape the 
attention of any conscientious investigator. Furthermore, any 
other factor differentiating the employees of the gas company 
from men in the rest of the community in such a way as to make 
a ten-to-one difference in lung cancer death rates would almost 
certainly be equally obvious. If it were found that workers in 
gas plants in many other parts of the country also had extremely 
high lung cancer death rates, any peculiar factor operating out¬ 
side the plants would be even easier to identify. Unless such 
were found, the only reasonable conclusion would be that some 
factor related to employment in a gas plant is a causative factor 
for lung cancer. 

In other words, one of the ways to prove that a causal relation¬ 
ship exists between two variables is to rule out other possible 
explanations for an observed association between them. Gen¬ 
erally speaking, this is not very difficult to determine one way 

184 


CAUSE AND EFFECT 

or the other when the association is very great. It may be very 
difficult if the association is not large. 

Now, there is conclusive evidence of an extremely high asso¬ 
ciation between lung cancer and cigarette smoking habits in 
men. If this is due fo the action of some third factor which is 
causative for both lung cancer and smoking, then that third 
factor should not be. difficult to discover. Although many factors 
have been studied extensively, no factor has been discovered 
which is at all likely to produce lung cancer in men and also to 
produce in them a desire to smoke cigarettes heavily. Unless and 
until such is found or unless strongly negative evidence of some 
other type is discovered, then the only reasonable conclusion is 
tiiat the association is due to cigarette smoking acting as a causa¬ 
tive factor in the production of lung cancer in men. This does not 
mean that smoking is a necessary factor and it may not even be 
a sufficient extrinsic factor for the production of lung cancer. 

The association between cigarette smoking and coronary 
artery disease mortality is not nearly as great as the association 
between cigarette smoking and lung cancer. Therefore, the in¬ 
terpretation of this association is much more dependent upon 
other evidence, which is discussed in another chapter. 

Time Trends and Geographic Studies 

In any field of investigation, the consistency of the evidence 
bearing on a subject influences the degree of confidence one 
has in drawing conclusions. Even evidence which has no in¬ 
terpretable meaning when taken aione should be examined from 
this standpoint. 
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No type of evidence is more difficult to interpret or more 
subject to overinterpretation than time trends. For example, 
there is no doubt that the over-all age-specific death rates in the 
United States and Western Europe have been declining steadily 
for many decades. Most of the decline has been in death rates 
from the infectious and parasitic diseases. There does not appear 
to have been a dramatic rise or fall in age-specific death rates 
for most specific sites of cancer, although this is open to some 
question. There are only two important diseases for which age- 
specific death rates in this country do appear to have risen 
dramatically during the last few decades, these being lung cancer 
and coronary artery disease. In both, the rise has been much 
greater in men than in women. After much dispute on the subject 
it is now generally agreed that most of the apparent rise in lung 
cancer death rates is real (and not merely due to better report¬ 
ing)/ 1 There is still doubt concerning coronary artery disease 
death rates, although I am of the opinion that they have in¬ 
creased considerably. During the same period of time there was 
a rise (until very recently) in the per capita sale of cigarettes in 
the United States and a decline in cigar and pipe tobacco sales. 

What, if anything, do those parallel time trends mean? In a 
positive way, they provide little or no evidence, because a great 
many other factors (some of which may be related to these two 
diseases) have changed during the same period of time.® How¬ 
ever, in a negative way, they do provide some evidence; that is, 
if the death rates from these two diseases had declined while 
cigarette sales had risen, then some doubt would have been cast 
on (he hypothesis that cigarette smoking is an important causa¬ 
tive factor in death rates from these diseases. The same can be 
said of the fall and subsequent rise in cigarette consumption and 
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coronary artery disease death rates in Norway during and after 
World War II. 

Broad geographic studies, such as a comparison between 
death rates in rural vs. urban areas, are subject to much the 
same difficulties in interpretation as time trends. There arc too 
many differences between urban and rural areas to make a 
reasonable interpretation of such associations as are found. 
Again, however, such studies may be useful in testing the con¬ 
sistency of evidence relating to the etiology of disease. One 
simply has to be cautious not to put too much weight on such 
evidence. If an extremely high (or an extremely low) death 
rate for a particular disease is found in some restricted area 
(such as a mining county in Montana), then the situation should 
be carefully investigated. 

Causative Factors Having Multiple Effects 

When the same agent acts as a causative factor in the pro¬ 
duction of two or more different effects, the problem of ascer¬ 
taining cause and clTect relationships becomes more complicated. 
This is particularly true if the several effects are, to some extent 
mutually exclusive. The problem is essentially the same whether 
the experimental or the purely observational method of study is 
employed. Two examples follow. 

Cigarette smoke seems to have a wide variety of effects on 
both men and experimental animals. Under some circumstances, 
these include effects on the nervous system, cardiovascular 
effects, and loss of appetite. The frequency of occurrence, the 
time of occurrence, and the severity of the various effects de¬ 
pend, among other things, upon dosage and individual variation 
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in sensitivity, if mice are continually exposed to an extremely 
high concentration of cigarette smoke, many of the animals die 
within a short time of acute effects. Those that die within a short 
time are most unlikely to develop a disease such as cancer, which 
typically occurs in later life. Esscnbcrg” has reported the results 
of a study in which this was a serious difficulty. If the concen¬ 
tration of smoke is greatly reduced in order to overcome this 
difficulty, it may be reduced below the minimum level for a 
carcinogenic effect — assuming that cigarette smoke could have 
such an effect on these particular animals. 

As has been mentioned, death rates among heavy cigarette 
smokers are much higher than among nonsmokers. At least two 
diseases, and probably more, are involved. The probability of 
eventual death from any one particular disease is reduced to the 
extent that the probability of death from other diseases is in¬ 
creased. This is true regardless of whether cigarette smoking is 
a causative factor in death from each of the diseases for which 
the dcatli rates of smokers are higher than those of nonsmokers. 
The multiplicity of nearly mutually exclusive factors associated 
with heavy cigarette smoking reduces the degree to which any 
single one of them can be associated with cigarette smoking. 

Mutual Causation 

Generally, in dealing with two associated variables, the only 
problem is to ascertain whether one is a causative factor for the 
production of the other. However, sometimes an effect is a causa¬ 
tive factor for the repetition or continuation of the same effect. 
This is true of heat produced by fire and of the end product of 
an autocatalytic chemical reaction. Sometimes each of two vari- 
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ables is a causative factor for the production of the other. This 
sort of situation develops with the use of habit-forming drugs. 
The drug produces physiologic or psychologic effects which 
in turn lead the addict to take the drug again. This may well be 
the case with cigarette smoking. Many people have the Impres¬ 
sion (which may or may not be so) that most heavy cigarette 
smokers are nervous, tense individuals. Because of this asso¬ 
ciation, it has been suggested that nervous tension is a causative 
factor leading to cigarette smoking. The observed association 
(assuming that it is true) could just as well be explained on the 
basis of habitual cigarette smoking being a causative factor in 
the production of nervous tension. Both explanations could be 
correct — and this is by no means an unreasonable hypothesis. 
Considering the difficulty which most heavy smokers experience 
when they attempt to give up the habit, it seems likely that some 
such relationship exists between the causes and the effects of 
cigarette smoking. It could be that certain cardiovascular effects 
of smoking are involved in this relationship. 

The Experimental Approach 

The major difficulty with purely observational studies is that 
a great many associated variables are at play in natural situ¬ 
ations, and this makes it difficult to identify true causative 
factors. The experimental method has some advantages in this 
respect, but it also has some serious disadvantages. Some of these 
will now be discussed. 

The most important feature of the experimental method is the 
control and limitation of variables. In a typical case, the in¬ 
vestigator attempts to hold constant all variables but two. One 
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of these (the independent variable) he deliberately varies and 
then observes the subsequent variation, if any, in the other (the 
dependent variable). Now suppose that he finds a statistically 
significant association between the two variables which is veri¬ 
fied by repetition. A willful act on the part of the investigator 
caused the variation in the independent variable, but other than 
this he knowingly did nothing to cause variation in the dependent 
variable. Therefore, it is usually safe to assume that under the 
conditions of the experiment the independent variable was a 
causative factor in relation to the change in the dependent vari¬ 
able. Of course, this assumes that the experiment was properly 
carried out. 

Tire difficulty lies in the qualifying phrase “under the condi¬ 
tions of the experiment,’' The cause of the eifect on the de¬ 
pendent variable was the totality of conditions sufficient to pro¬ 
duce it. This included many, if not all, of the conditions of the 
experiment, the choice of biologic material (e.g., the experi¬ 
mental animals used), and the degree of variation deliberately 
imposed upon the independent variable. 

If the conditions of the experiment and the degree of variation 
of the independent variable were identical to those applying to 
human beings in natural situations, then there is no problem in 
interpretation. It is rare that this can be achieved in practice. 
For example, in an attempt to determine the immediate effects 
of cigarette smoking, one might measure pulse rates in two com¬ 
parable groups of men, one group smoking lighted cigarettes and 
the other drawing air through unlightcd cigarettes. Differences 
in pulse rates in the two groups might be attributed to the direct 
physiologic effect of nicotine or some other component of 
tobacco smoke. However, the fact that an experiment is being 
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undertaken creates a somewhat artificial situation. The differ¬ 
ence might be due to psychologic factors, since each man 
knows whether he was in the experimental or the control group. 
This is exactly the lype of difficulty which often occurs in obser¬ 
vational studies of naturally occurring conditions (i.e., confusion 
in distinguishing between a causative factor and an associated 
factor which is not causative of the effect in question), and 
it is not necessarily eliminated by the experimental approach. 
However, by a scries of different, well-planned experiments 
(or observational studies) this difficulty can usually be over¬ 
come. 

More often than not, experimental conditions are very remote 
indeed from naturally occurring situations applicable to man. 
Then, even though cause and effect relationships may be estab¬ 
lished, there is no guarantee whatsoever that the findings are 
applicable to man in natural situations. A few examples are 
sufficient to illustrate this point. 

Cigarette smoke bubbled through a culture of certain patho¬ 
genic bacteria in a test tube will kill the bacteria. 0 This does not 
prove that smoking a cigarette will kill bacteria in the human 
body. However, other experiments have shown that smoking 
does kill or limit the growth of some types of bacteria in the 
mouth. 1 This suggests that smoking might reduce the probability 
that an individual will develop an infectious respiratory disease. 
The fact that death rates from infectious respiratory disease have 
rapidly decreased as the per capita consumption of cigarettes 
has increased is consistent with this hypothesis. However, all of 
this evidence taken together does not constitute proof of the 
hypothesis. Now suppose that it could be shown that the inci¬ 
dence mtes of infectious respiratory diseases are much lower 
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among smokers than among nonsmokers. The experimental find¬ 
ings would offer a reasonable explanation for the association. 
This would come dose to proving the hypothesis, although one 
would probably want a bit more evidence before drawing abso¬ 
lutely positive conclusions. 

A certain chemical substance is a causative factor in carcino¬ 
genesis when applied to the skin of a particular inbred strain of 
mice; but it has little or no carcinogenic elTect on certain other 
inbred strains of mice, or certain other species of animals. There¬ 
fore, lacking other evidence, there is no assurance that the 
substance is carcinogenic for man. Carcinogenesis involving 
chemical substances may, and in some instances is known to, 
depend upon a complex combination of factors, both intrinsic 
and extrinsic. I have little doubt that, given enough time, a clever 
investigator could create a set of conditions in which a wide 
variety of ordinarily harmless substances could be shown to play 
a role in carcinogenesis. 

Negative results of experiments may have little or no meaning 
in relation to naturally occurring events. Often the results are 
negative simply because the investigator failed to establish the 
conditions under which a certain factor plays a role in the causa¬ 
tion of the effect in question. In 1775, Percival Pott 13 showed 
that cancer of the scrotum, which is a very rare disease, occurred 
with considerable frequency in chimney sweeps. The association 
was so strong, and other explanations so unlikely, that this was 
interpreted to mean that coal soot under certain conditions is a 
causative factor in the production of human cancer (at least 
cancer of this specific site). However, it was not until 1915 that 
Yamagiwa and Itchikawa ,r ’ succeeded in producing cancer ex¬ 
perimentally with soot. Previous experimenters had failed to 
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apply the soot for a long enough period of time or had used 
animals which were not susceptible. 

For these reasons, I am of the opinion that negative experi¬ 
mental results do not necessarily constitute negative evidence 
concerning cause and effect relationships in an association ob¬ 
served in natural situations. This is particularly true if the ex¬ 
perimental conditions are remote from natural situations. 

In some instances it is possible to obtain direct and conclusive 
experimental evidence that a certain naturally occurring event 
is a causative factor in the production of a disease in man. This 
was true, for example, of human experiments showing that 
mosquitoes transmit yellow fever. Likewise, the effect of smok¬ 
ing on patients with thromboangiitis obliterans has been con¬ 
vincingly demonstrated, as described in another chapter. Gen¬ 
erally speaking, the best possible proof can be obtained in those 
instances where it is possible to use man as the experimental 
animal and at the same time closely reproduce naturally oc¬ 
curring situations. However, this is seldom feasible in the case 
of diseases with high fatality rates. It is even less feasible in cases 
where an agent is suspected as being a causative factor not in 
the production of disease but in death from pre-existing disease. 
In such instances, well-planned observational studies can pro¬ 
vide evidence which makes it extremely probable that a cause 
and effect relationship exists. This interpretation is greatly re¬ 
inforced if all other known facts are consistent with the hypoth¬ 
esis, if experimental evidence Indicates that the hypothesis is 
at least reasonable, and if no appreciable evidence can be found 
in support of any reasonable alternative hypothesis. Perhaps it 
is never possible to prove with absolute certainty that a naturally 
occurring event was the product of a specified set of naturally 
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occurring conditions. However, such can often be established 
beyond reasonable doubt. In human alTnirs, important de¬ 
cisions must necessarily be based upon the preponderance of 
evidence. 


Summary 

A cause of a disease has been defined as a totality of condi¬ 
tions that is sufficient to produce it. A causative factor has been 
defined as any one of the conditions which taken together make 
up a cause. A causative factor may not be necessary for the 
production of a disease, but the presence of a potential causative 
factor increases the probability that the disease will occur in a 
given situation. In epidemiologic research the problem is to 
discover causative factors which increase the probability of 
human disease or death and are operative and important in 
naturally occurring conditions to which human beings are ex¬ 
posed. This can only be determined by a study of the natural 
conditions under which a disease occurs in man. The advantages 
and shortcomings of the retrospective method and the pro¬ 
spective method of study have been described in some detail. 
Such studies reveal associations. If one variable is a causative 
factor in the production of the other, then the two factors must 
be associated; but the converse is not necessarily true. Supple¬ 
mentary evidence must be studied for consistency and to rule 
out other possible explanations before an observed association 
can be interpreted as indicating a direct cause and effect relation¬ 
ship. This is usually not too difficult to determine if the associ¬ 
ation is very high. It is more difficult if the association is low. 
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The experimental approach can supply valuable supplementary 
information but has definite limitations in most instances. 
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